Improving Seek Time for Column Store Using MMH Algorithm

نویسندگان

  • Tejaswini Apte
  • Maya Ingle
  • A. K. Goyal
چکیده

Hash based search has, proven excellence on large data warehouses stored in column store. Data distribution has significant impact on hash based search. To reduce impact of data distribution, we have proposed Memory Managed Hash (MMH) algorithm that uses shift XOR group for Queries and Transactions in column store. Our experiments show that MMH improves read and write throughput by 22% for TPC-H distribution. KeywordsLoad; Selectivity; Seek; TPC-H; Algorithms; Hash.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Cost-Aware Strategy for Merging Differential Stores in Column-Oriented In-Memory DBMS

Fast execution of analytical and transactional queries in column-oriented in-memory DBMS is achieved by combining a readoptimized data store with a write-optimized differential store. To maintain high read performance, both structures must be merged from time to time. In this paper we describe a new merge algorithm that applies full and partial merge operations based on their costs and improvem...

متن کامل

Using Fuzzy Cognitive Maps for Prediction of Knowledge Worker Productivity Based on Real Coded Genetic Algorithm

  Improving knowledge worker productivity has been one of the most important tasks of the century. However, we have few measures or management interventions to make such improvement possible, and it is difficult to identify patterns that should be followed by knowledge workers because systems and processes in an organization are often regarded as a death blow to creativity. In this paper, we se...

متن کامل

Implementing K - means Algorithm using Row store and Column store databases : A case study

K-means Clustering is an important algorithm for identifying the structure in data. K-means is the simplest clustering algorithm [8]. This algorithm uses as input a predefined number of clusters i.e., the K from its name. Mean stands for an average, an average location of all the members of a particular cluster. In this work, a novel approach to seeding the clusters with a latent data structure...

متن کامل

Rejection of the Feed-Flow Disturbances in a Multi-Component Distillation Column Using a Multiple Neural Network Model-Predictive Controller

This article deals with the issues associated with developing a new design methodology for the nonlinear model-predictive control (MPC) of a chemical plant. A combination of multiple neural networks is selected and used to model a nonlinear multi-input multi-output (MIMO) process with time delays.  An optimization procedure for a neural MPC algorithm based on this model is then developed. T...

متن کامل

Applied Ergonomics - rok 2010 , ročník 41

Youth and adolescents are routinely engaged in manual material handling (MMH) tasks that may exceed their strength capability to perform the task and may place them at excessive risk for musculoskeletal disorders. This paper reports on a two-dimensional biomechanical model that was developed to assess MMH tasks performed by youth 3–21 years of age. The model uses age, gender, posture of the you...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1204.1598  شماره 

صفحات  -

تاریخ انتشار 2012